Active Ranking in Practice: General Ranking Functions with Sample Complexity Bounds

نویسندگان

Kevin G. Jamieson

Robert D. Nowak

چکیده

This paper examines the problem of ranking a collection of objects using pairwise comparisons (rankings of two objects). In a companion paper in the regular NIPS 2011 program [1], we showed that if each object x ∈ Rd is assigned a score f(x) = ||x − r|| for some unknown r ∈ Rd, then our recently proposed active ranking algorithm can recover the ranking of the scores using about d log n selectively chosen pairwise comparisons. Here we show that this same model contains all functions of the type g(x) = wTx for some unknown w ∈ Rd, thus the same bound applies. We take advantage of this fact and use kernel methods to represent more general ranking functions. This extension includes popular ranking methods such as RankSVM, and we derive nontrivial query complexity bounds for active versions of such algorithms. The efficacy of the theory and method are demonstrated by applying our kernelized adaptive algorithm to two real datasets. 1 Problem statement Given a set of n objects Θ := {θ1, . . . , θn}, we wish to discover how an oracle ranks these objects. The ranking, denoted by σ, can be thought of as a mapping σ : {1, . . . , n} →{ 1, . . . , n} that prescribes an order σ({θi}i=1) := θσ(1) ≺ θσ(2) ≺ · · · ≺ θσ(n−1) ≺ θσ(n) (1) where θi ≺ θj means θi precedes, or is preferred to, θj in the oracle’s ranking. The ranking can be learned by querying the oracle for pairwise comparisons of objects. The primary objective here is to bound the number of pairwise comparisons needed to correctly determine the ranking when the objects (and hence rankings) satisfy certain known structural constraints. We define a ranking function to be f : Θ→ R such that θi ≺ θj ⇐⇒ f(θi) < f(θj). (2) We say two ranking functions f and g are equivalent if both ranking functions correspond to the same ranking σ. In general, there are n! ways to permute n objects and we can always find an f that obeys (2) for any desired permutation. However, we assume that the oracle’s ranking function belongs to a certain class denoted by F , which may limit the set of possible rankings. Given a set of objects Θ and a ranking function class F , we denote this constrained set of possible rankings by ΣΘ,F . While F may be uncountably infinite, because of the equivalence of ranking functions, ΣΘ,F is a subset of Sn (symmetric group over n objects) and so its cardinality |ΣΘ,F | is at most n!. 2 Main theoretical results We proposed an active approach to learning rankings in a companion paper in the NIPS 2011 conference [1]. In that paper, we show that if F := {f(θ) = ||φ(θ)− r||, r ∈ Rd} where φ : Θ → Rd is fixed and known, then we can discover a ranking selected uniformly at random from the set ΣΘ,F

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Active Learning for Top-K Rank Aggregation from Noisy Comparisons

We explore an active top-K ranking problem based on pairwise comparisons that are collected possibly in a sequential manner as per our design choice. We consider two settings: (1) top-K sorting in which the goal is to recover the top-K items in order out of n items; (2) top-K partitioning where only the set of top-K items is desired. Under a fairly general model which subsumes as special cases ...

متن کامل

Efficiency Evaluation and Ranking DMUs in the Presence of Interval Data with Stochastic Bounds

On account of the existence of uncertainty, DEA occasionally faces the situation of imprecise data, especially when a set of DMUs include missing data, ordinal data, interval data, stochastic data, or fuzzy data. Therefore, how to evaluate the efficiency of a set of DMUs in interval environments is a problem worth studying. In this paper, we discussed the new method for evaluation and ranking i...

متن کامل

A Statistical Convergence Perspective of Algorithms for Rank Aggregation from Pairwise Data

There has been much interest recently in the problem of rank aggregation from pairwise data. A natural question that arises is: under what sorts of statistical assumptions do various rank aggregation algorithms converge to an ‘optimal’ ranking? In this paper, we consider this question in a natural setting where pairwise comparisons are drawn randomly and independently from some underlying proba...

متن کامل

On Multiphase-Linear Ranking Functions

Multiphase ranking functions (MΦRFs) were proposed as a means to prove the termination of a loop in which the computation progresses through a number of “phases”, and the progress of each phase is described by a different linear ranking function. Our work provides new insights regarding such functions for loops described by a conjunction of linear constraints (single-path loops). We provide a c...

متن کامل

Multi-dimensional Rankings, Program Termination, and Complexity Bounds of Flowchart Programs

Proving the termination of a flowchart program can be done by exhibiting a ranking function, i.e., a function from the program states to a wellfounded set, which strictly decreases at each program step. A standard method to automatically generate such a function is to compute invariants for each program point and to search for a ranking in a restricted class of functions that can be handled wit...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2011

Active Ranking in Practice: General Ranking Functions with Sample Complexity Bounds

نویسندگان

چکیده

منابع مشابه

Active Learning for Top-K Rank Aggregation from Noisy Comparisons

Efficiency Evaluation and Ranking DMUs in the Presence of Interval Data with Stochastic Bounds

A Statistical Convergence Perspective of Algorithms for Rank Aggregation from Pairwise Data

On Multiphase-Linear Ranking Functions

Multi-dimensional Rankings, Program Termination, and Complexity Bounds of Flowchart Programs

عنوان ژورنال:

اشتراک گذاری